Aralex: A lexical database for Modern Standard Arabic
نویسندگان
چکیده
منابع مشابه
Aralex: a lexical database for Modern Standard Arabic.
In this article, we present a new lexical database for Modern Standard Arabic: Aralex. Based on a contemporary text corpus of 40 million words, Aralex provides information about (1) the token frequencies of roots and word patterns, (2) the type frequency, or family size, of roots and word patterns, and (3) the frequency of bigrams, trigrams in orthographic forms, roots, and word patterns. Arale...
متن کاملAralex: A lexical database for Modern Standard Arabic
Psycholinguistic databases, providing statistical information such as word frequency, length, and imageability, have proved to be invaluable tools for the experimental investigation of the cognitive processes underlying language functions and for the design of language assessment tools for both educational and clinical purposes (Lété, Sprenger-Charolles, & Colé, 2004; Stadthagen-Gonzalez & Davi...
متن کاملA Lexical Database for Modern Standard Arabic Interoperable with a Finite State Morphological Transducer
Current Arabic lexicons, whether computational or otherwise, make no distinction between entries from Modern Standard Arabic (MSA) and Classical Arabic (CA), and tend to include obsolete words that are not attested in current usage. We address this problem by building a large-scale, corpus-based lexical database that is representative of MSA. We use an MSA corpus of 1,089,111,204 words, a pre-a...
متن کاملBuilding A Modern Standard Arabic Corpus
Language Engineering, including Information Retrieval, Machine Translation and other Natural Language-related disciplines, is showing in recent years more interest in the Arabic language. Suitable resources for Arabic are becoming a vital necessity for the progress of this research. Until recently, only two Arabic corpora were commonly available for researchers: the AFP Arabic newswire from LDC...
متن کاملCUNIT: A Semantic Role Labeling System for Modern Standard Arabic
In this paper, we present a system for Arabic semantic role labeling (SRL) based on SVMs and standard features. The system is evaluated on the released SEMEVAL 2007 development and test data. The results show an Fβ=1 score of 94.06 on argument boundary detection and an overall Fβ=1 score of 81.43 on the complete semantic role labeling task using gold parse trees.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Behavior Research Methods
سال: 2010
ISSN: 1554-351X,1554-3528
DOI: 10.3758/brm.42.2.481